IRIX Base Documentation 2002 November

home *** CD-ROM | disk | FTP | other *** search

/ IRIX Base Documentation 2002 November / SGI IRIX Base Documentation 2002 November.iso / usr / share / catman / p_man / cat3 / SCSL / intro_fft.z / intro_fft

Wrap

Text File | 2002-10-03 | 32.6 KB | 661 lines

IIIINNNNTTTTRRRROOOO____FFFFFFFFTTTT((((3333SSSS)))) IIIINNNNTTTTRRRROOOO____FFFFFFFFTTTT((((3333SSSS)))) NNNNAAAAMMMMEEEE IIIINNNNTTTTRRRROOOO____FFFFFFFFTTTT - Introduction to signal processing routines IIIIMMMMPPPPLLLLEEEEMMMMEEEENNNNTTTTAAAATTTTIIIIOOOONNNN See individual man pages for operating system and hardware availability DDDDEEEESSSSCCCCRRRRIIIIPPPPTTTTIIIIOOOONNNN The signal processing routines currently consist of Fast Fourier Transform (FFT) routines, convolution routines, and correlation routines. The following data types are used in these routines: * Single precision: Fortran "real" data type, C/C++ "float" data type, 32-bit floating point; these routine names begin with SSSS. * Single precision complex: Fortran "complex" data type, C/C++ "scsl_complex" data type (defined in <<<<ssssccccssssllll____fffffffftttt....hhhh>>>>), C++ STL "complex<float>" data type (defined in <<<<ccccoooommmmpppplllleeeexxxx....hhhh>, two 32-bit floating point reals; these routine names begin with CCCC. * Double precision: Fortran "double precision" data type, C/C++ "double" data type, 64-bit floating point; these routine names begin with DDDD. * Double precision complex: Fortran "double complex" data type, C/C++ "scsl_zomplex" data type (defined in <<<<ssssccccssssllll____fffffffftttt....hhhh>>>>), C++ STL "complex<double>" data type (defined in <<<<ccccoooommmmpppplllleeeexxxx....hhhh>>>>), two 64-bit floating point doubles; these routine names begin with ZZZZ. NOTE: when using the C++ Standard Template Library (STL) to define complex types, the include files must be used in the following order: #include <complex.h> #include <scsl_fft.h> Often little or no difference exists between these versions, other than the data types of some inputs and outputs. In this case, the routines are described on the same man page, and that man page is named after the real or complex routine. The mmmmaaaannnn(1) command can find a man page online by either the real, complex, double precision, or double complex name. The data types for the _s_c_a_l_e, _t_a_b_l_e, and _w_o_r_k arguments in these routines vary, depending on the function which is called. In the CCCCCCCC, SSSSCCCC, and CCCCSSSS routines, the arguments are single precision. In the ZZZZZZZZ, DDDDZZZZ and ZZZZDDDD routines, the arguments are double precision. By default, the integer arguments are 4 bytes (32 bits) in size; this is the size obtained when one links to the SCSL library with ----llllssssccccssss or ----llllssssccccssss____mmmmpppp. Another version of SCSL is available, however, in which PPPPaaaaggggeeee 1111 IIIINNNNTTTTRRRROOOO____FFFFFFFFTTTT((((3333SSSS)))) IIIINNNNTTTTRRRROOOO____FFFFFFFFTTTT((((3333SSSS)))) integers are 8 bytes (64 bits). This version allows the user access to larger memory sizes and helps when porting legacy Cray codes. It can be loaded by using either the ----llllssssccccssss____iiii8888 or ----llllssssccccssss____iiii8888____mmmmpppp link option. Note that any program may use only one of the two versions; 4-byte integer and 8-byte integer library calls cannot be mixed. C/C++ function prototypes for the signal processing routines are provided in <<<<ssssccccssssllll____fffffffftttt....hhhh>>>>, when using the default 4-byte integers, and <<<<ssssccccssssllll____fffffffftttt____iiii8888....hhhh>>>>, when using 8-byte integers. These header files define the complex types ssssccccssssllll____ccccoooommmmpppplllleeeexxxx and ssssccccssssllll____zzzzoooommmmpppplllleeeexxxx, which are used in the prototypes. Alternatively, C++ programs may declare arguments using the types ccccoooommmmpppplllleeeexxxx<<<<ffffllllooooaaaatttt>>>> and ccccoooommmmpppplllleeeexxxx<<<<ddddoooouuuubbbblllleeee>>>> from the standard template library. But if these types are used, <<<<ccccoooommmmpppplllleeeexxxx....hhhh>>>> must be included before <<<<ssssccccssssllll____fffffffftttt....hhhh>>>> (or <<<<ssssccccssssllll____fffffffftttt____iiii8888....hhhh>>>>). Note, though, that both complex types are equivalent: they simply represent (real, imaginary) pairs of floating point numbers stored contiguously in memory. With the proper casts, you can simply pass arrays of floating point data to the routines where complex arguments are expected. Casts, however, can be avoided. The header files <<<<ssssccccssssllll____fffffffftttt....hhhh>>>> and <<<<ssssccccssssllll____fffffffftttt____iiii8888....hhhh>>>> directly support the use of user-defined complex types or disabling prototype checking for complex arguments completely. By defining the symbol SSSSCCCCSSSSLLLL____VVVVOOOOIIIIDDDD____AAAARRRRGGGGSSSS before including <<<<ssssccccssssllll____fffffffftttt....hhhh>>>> or <<<<ssssccccssssllll____fffffffftttt____iiii8888....hhhh>>>> all complex arguments will be prototyped as vvvvooooiiiidddd ****. To define the symbol SSSSCCCCSSSSLLLL____VVVVOOOOIIIIDDDD____AAAARRRRGGGGSSSS at compile time use the ----DDDD compiler option (i.e., ----DDDDSSSSCCCCSSSSLLLL____VVVVOOOOIIIIDDDD____AAAARRRRGGGGSSSS) or use an explicit ####ddddeeeeffffiiiinnnneeee SSSSCCCCSSSSLLLL____VVVVOOOOIIIIDDDD____AAAARRRRGGGGSSSS in the source code. This allows the use of any complex data structure without warnings from the compiler, provided the structure is as described above; that is: 1. The real and imaginary components must be contiguous in memory. 2. Sequential array elements must also be contiguous in memory. While this allows the use of non-standard complex types without generating compiler warnings, it has the disadvantage that the compiler will not catch type mismatches. Strong type checking can be enabled employing user-defined complex types instead of SCSL's standard complex types. To do this, define SSSSCCCCSSSSLLLL____UUUUSSSSEEEERRRR____CCCCOOOOMMMMPPPPLLLLEEEEXXXX____TTTT====_m_y__c_o_m_p_l_e_x and SSSSCCCCSSSSLLLL____UUUUSSSSEEEERRRR____ZZZZOOOOMMMMPPPPLLLLEEEEXXXX____TTTT====_m_y__z_o_m_p_l_e_x, where _m_y__c_o_m_p_l_e_x and _m_y__z_o_m_p_l_e_x are the names of user-defined complex types. These complex types must be defined before including the <<<<ssssccccssssllll____fffffffftttt....hhhh>>>> (or <<<<ssssccccssssllll____fffffffftttt____iiii8888....hhhh>>>>) header file. Fortran 90 users on IRIX systems can perform compile-time checking of SCSL FFT subroutine calls by adding UUUUSSSSEEEE SSSSCCCCSSSSLLLL____FFFFFFFFTTTT (for 4-byte integer arguments) or UUUUSSSSEEEE SSSSCCCCSSSSLLLL____FFFFFFFFTTTT____IIII8888 (for 8-byte integer arguments) to the source code from which the FFT calls are made. Alternatively, the compile-time checking can be invoked without any source code modifications by using the ----aaaauuuuttttoooo____uuuusssseeee compiler option, e.g., PPPPaaaaggggeeee 2222 IIIINNNNTTTTRRRROOOO____FFFFFFFFTTTT((((3333SSSS)))) IIIINNNNTTTTRRRROOOO____FFFFFFFFTTTT((((3333SSSS)))) f90 -auto_use SCSL_FFT test.f -lscs f90 -auto_use SCSL_FFT_I8 -i8 test.f -lscs_i8 FFFFFFFFTTTT rrrroooouuuuttttiiiinnnneeeessss These routines apply to one or more FFTs. Following is a table of supported FFT routines. Each of these routines is highly optimized for single-processor use. The two-dimensional, three-dimensional, and one-dimensional multiple routines are also multitasked (multi-threaded) for all sizes for which there is a performance benefit; the one-dimensional routines are multitasked if the data size exceeds the size of the largest processor cache. Each routine can compute either a forward or an inverse Fourier transform. In this table, rows of the table represent input and output data types for the routines in each column.: * CCCC---->>>>CCCC implies 32-bit complex input and output. * ZZZZ---->>>>ZZZZ implies 64-bit double complex input and 64-bit double complex output. Each routine in this row is documented with the complex routine in the above row. * SSSS---->>>>CCCC implies 32-bit real input and 32-bit complex output. * DDDD---->>>>ZZZZ implies 64-bit double precision real input and 64-bit double precision complex output. Each routine in this row is documented with the real-to-complex routine in the above row. * CCCC---->>>>SSSS implies 32-bit complex input and 32-bit real output. * ZZZZ---->>>>DDDD implies 64-bit double complex input and 64-bit double precision output. Each routine named in this row is documented with the complex - real routine in the above row. Columns of the table represent the number of dimensions for which the FFT is calculated for the routines in each row: * One-dimensional (single) calculates one FFT in one dimension. * One-dimensional (multiple) calculates an FFT in one dimension for each column of a two-dimensional matrix. * Two-dimensional calculates one FFT in two dimensions. * Three-dimensional calculates one FFT in three dimensions. --------------------------------------------------------------------------- 1-dimensional 1-dimensional 2-dimensional 3-dimensional (single) (multiple) --------------------------------------------------------------------------- PPPPaaaaggggeeee 3333 IIIINNNNTTTTRRRROOOO____FFFFFFFFTTTT((((3333SSSS)))) IIIINNNNTTTTRRRROOOO____FFFFFFFFTTTT((((3333SSSS)))) C->C CCFFT CCFFTM CCFFTMR CCFFT2D CCFFT3D Z->Z ZZFFT ZZFFTM ZZFFTMR ZZFFT2D ZZFFT3D S->C SCFFT SCFFTM SCFFT2D SCFFT3D D->Z DZFFT DZFFTM DZFFT2D DZFFT3D C->S CSFFT CSFFTM CSFFT2D CSFFT3D Z->D ZDFFT ZDFFTM ZDFFT2D ZDFFT3D --------------------------------------------------------------------------- NNNNOOOOTTTTEEEESSSS The FFT routines were designed so that they can be implemented efficiently on many different architectures. The calling sequence is the same in any implementation. Certain details, however, depend on the particular implementation. One area of difference is the size of the _t_a_b_l_e and _w_o_r_k arrays. Different systems may need different sizes. The subroutine call requires no change, but you may have to change array sizes in the DDDDIIIIMMMMEEEENNNNSSSSIIIIOOOONNNN or type statements that declare the arrays. The following are the required array sizes for the Origin series. The values of _N_R and _N_F_R are explained below: * CCCCCCCCFFFFFFFFTTTT _t_a_b_l_e: 2_n + _N_F REAL WORDS _w_o_r_k: 2_n REAL WORDS * ZZZZZZZZFFFFFFFFTTTT _t_a_b_l_e: 2_n + _N_F DBL PREC WORDS _w_o_r_k: 2_n DBL PREC WORDS * CCCCCCCCFFFFFFFFTTTTMMMMRRRR _t_a_b_l_e: 2_n + _N_F REAL WORDS _w_o_r_k: 2_n REAL WORDS * ZZZZZZZZFFFFFFFFTTTTMMMMRRRR _t_a_b_l_e: 2_n + _N_F DBL PREC WORDS _w_o_r_k: 2_n DBL PREC WORDS * CCCCCCCCFFFFFFFFTTTT2222DDDD _t_a_b_l_e: (2*_n_1+_N_F) + (2*_n_2+_N_F) REAL WORDS _w_o_r_k: 2*MMMMAAAAXXXX((((_n_1,,,,_n_2)))) REAL WORDS * ZZZZZZZZFFFFFFFFTTTT2222DDDD _t_a_b_l_e: (2*_n_1+_N_F) + (2*_n_2+_N_F) DBL PREC WORDS _w_o_r_k: 2*MMMMAAAAXXXX((((_n_1,,,,_n_2)))) DBL PREC WORDS * CCCCCCCCFFFFFFFFTTTT3333DDDD _t_a_b_l_e: (2*_n_1+_N_F) + (2*_n_2+_N_F) + (2*_n_3+_N_F) REAL WORDS _w_o_r_k: 2*MMMMAAAAXXXX((((_n_1,,,,_n_2,,,,_n_3)))) REAL WORDS PPPPaaaaggggeeee 4444 IIIINNNNTTTTRRRROOOO____FFFFFFFFTTTT((((3333SSSS)))) IIIINNNNTTTTRRRROOOO____FFFFFFFFTTTT((((3333SSSS)))) * ZZZZZZZZFFFFFFFFTTTT3333DDDD _t_a_b_l_e: (2*_n_1+_N_F) + (2*_n_2+_N_F) + (2*_n_3+_N_F) DBL PREC WORDS _w_o_r_k: 2*MMMMAAAAXXXX((((_n_1,,,,_n_2,,,,_n_3)))) DBL PREC WORDS * CCCCCCCCFFFFFFFFTTTTMMMM _t_a_b_l_e: (_N_F + 2 * _n) REAL WORDS _w_o_r_k: 2_n REAL WORDS * ZZZZZZZZFFFFFFFFTTTTMMMM _t_a_b_l_e: (_N_F + 2 * _n) DBL PREC WORDS _w_o_r_k: 2_n DBL PREC WORDS * SSSSCCCCFFFFFFFFTTTT, CCCCSSSSFFFFFFFFTTTT _t_a_b_l_e: (_n+_N_F_R) REAL WORDS _w_o_r_k: _n+2 REAL WORDS * DDDDZZZZFFFFFFFFTTTT, ZZZZDDDDFFFFFFFFTTTT _t_a_b_l_e: (_n+_N_F_R) DBL PREC WORDS _w_o_r_k: _n + 2 DBL PREC WORDS * SSSSCCCCFFFFFFFFTTTT2222DDDD, CCCCSSSSFFFFFFFFTTTT2222DDDD _t_a_b_l_e: (_n+_N_F_R) + (2*_n_2+_N_F) REAL WORDS _w_o_r_k: _n_1+4*_n_2 REAL WORDS * DDDDZZZZFFFFFFFFTTTT2222DDDD, ZZZZDDDDFFFFFFFFTTTT2222DDDD _t_a_b_l_e: (_n_1+_N_F_R) + (2*_n_2+_N_F) DBL PREC WORDS _w_o_r_k: _n_1 + 4 * _n_2 DBL PREC WORDS * SSSSCCCCFFFFFFFFTTTT3333DDDD, CCCCSSSSFFFFFFFFTTTT3333DDDD _t_a_b_l_e: (_n_1+_N_F_R) + (2*_n_2+_N_F) + (_2*_n_3+_N_F) REAL WORDS _w_o_r_k: _n_1 + 4 * n3 REAL WORDS * DDDDZZZZFFFFFFFFTTTT3333DDDD, ZZZZDDDDFFFFFFFFTTTT3333DDDD _t_a_b_l_e: (_n_1+_N_F_R) + (2*_n_2+_N_F) + (2*_n_3+_N_F) DBL PREC WORDS _w_o_r_k: _n_1 + 4 * n3 DBL PREC WORDS * SSSSCCCCFFFFFFFFTTTTMMMM, CCCCSSSSFFFFFFFFTTTTMMMM _t_a_b_l_e: (_n+_N_F_R) REAL WORDS _w_o_r_k: _n + 2 REAL WORDS * DDDDZZZZFFFFFFFFTTTTMMMM, ZZZZDDDDFFFFFFFFTTTTMMMM _t_a_b_l_e: (_n+_N_F_R) DBL PREC WORDS _w_o_r_k: _n + 2 DBL PREC WORDS The second area of difference is the _i_s_y_s parameter array, an array that gives certain implementation-specific information. All features and functions of the FFT routines specific to any particular implementation are confined to this _i_s_y_s array. On any implementation, you can use the default values by using an argument value of 0. PPPPaaaaggggeeee 5555 IIIINNNNTTTTRRRROOOO____FFFFFFFFTTTT((((3333SSSS)))) IIIINNNNTTTTRRRROOOO____FFFFFFFFTTTT((((3333SSSS)))) In the Origin series implementation, _i_s_y_s((((0000))))====0000 and _i_s_y_s((((0000))))=1 are supported. In SCSL versions prior to 1.3, only _i_s_y_s((((0000))))====0000 was allowed. For _i_s_y_s((((0000))))====0000, _N_F====33330000 and _N_F_R====11115555, and for _i_s_y_s((((0000))))====1111, _N_F=_N_F_R=222255556666. The _N_F(_R) words of storage in the table array contain a factorization of the length of the transform. The smaller values of _N_F and _N_F_R for _i_s_y_s((((0000))))====0000 are historical. They are too small to store all the required factors for the highest performing FFT, so when _i_s_y_s((((0000))))====0000, extra space is allocated when the table array is initialized. To avoid memory leaks, this extra space must be deallocated when the table array is no longer needed. The routines CCCCCCCCFFFFFFFFTTTTFFFF, CCCCCCCCFFFFFFFFTTTTMMMMFFFF, etc., are used to release this memory. Due to the potential for memory leaks, the use of _i_s_y_s((((0000))))====0000 should be avoided. For _i_s_y_s((((0000))))====1111, the values of _N_F and _N_F_R are large enough so that no extra memory needs to be allocated, and there is no need to call CCCCCCCCFFFFFFFFTTTTFFFF, etc. to release memory. (If called, these routines will do nothing.) NOTE: _i_s_y_s((((0000)))) ==== 1111 means that _i_s_y_s is an integer array with two elements. The second element, _i_s_y_s((((1111)))), will not be accessed. Finally, in addition to the _w_o_r_k array, the FFT routines also dynamically allocate scratch space from the stack. The amount of space allocated can be slightly bigger than the size of the largest processor cache. For single processor runs, the default stack size is large enough that these allocations generally cause no problems. But for parallel runs, you need to ensure that the stack size of slave threads is big enough to hold this scratch space. Failure to reserve sufficient stack space will cause programs to dump core due to stack overflows. The stack size of MP library slave threads is controlled via the MMMMPPPP____SSSSLLLLAAAAVVVVEEEE____SSSSTTTTAAAACCCCKKKKSSSSIIIIZZZZEEEE environment variable or the mmmmpppp____sssseeeetttt____ssssllllaaaavvvveeee____ssssttttaaaacccckkkkssssiiiizzzzeeee(((()))) library routine. See the mmmmpppp(3C), mmmmpppp(3F) and ppppeeee____eeeennnnvvvviiiirrrroooonnnn(5) reference pages for more information on controlling the slave stack size. For pthreads applications, the thread's stack size is specified as one of many creation attributes provided in the pthread_attr_t argument to pppptttthhhhrrrreeeeaaaadddd____ccccrrrreeeeaaaatttteeee(3P). The stacksize attribute should be set explicitly to a non-default value using the pppptttthhhhrrrreeeeaaaadddd____aaaattttttttrrrr____sssseeeettttssssttttaaaacccckkkkssssiiiizzzzeeee(3P) call, described in the pppptttthhhhrrrreeeeaaaadddd____aaaattttttttrrrr____iiiinnnniiiitttt(3P) man page. RRRReeeeaaaallll----ttttoooo----ccccoooommmmpppplllleeeexxxx FFFFFFFFTTTTssss In the formulas on the man pages, there are _n real input values, and _n/2+1 complex output values. This property is characteristic of real- to-complex FFTs. The mathematical definition of the Fourier transform takes a sequence of _n complex values and transforms it to another sequence of _n complex values. A complex-to-complex FFT routine, such as CCCCCCCCFFFFFFFFTTTT or CCCCCCCCFFFFFFFFTTTTMMMM, will take _n complex input values, and produce _n complex output values. In fact, one easy way to compute a real-to-complex FFT is to store the input data, _x, in a complex array, then call routine CCCCCCCCFFFFFFFFTTTT to compute the FFT. You get the same answer when using the SSSSCCCCFFFFFFFFTTTT/SSSSCCCCFFFFFFFFTTTTMMMM routine. PPPPaaaaggggeeee 6666 IIIINNNNTTTTRRRROOOO____FFFFFFFFTTTT((((3333SSSS)))) IIIINNNNTTTTRRRROOOO____FFFFFFFFTTTT((((3333SSSS)))) A separate real-to-complex FFT routine is more efficient. Because the input data are real, you can make use of this fact to save almost half of the computational work. The theory of Fourier transforms tells us that for real input data, you have to compute only the first _n/2 + 1 complex output values, because the remaining values can be computed from the first half of the values by the simple formula: _Y_K,_L = conjg(_Y_n-_k,_L) _f_o_r _n/_2 <= _k <= _n-1 where the notation conjg(_Y) represents the complex conjugate of _y. In fact, in many applications, the second half of the complex output data are never explicitly computed or stored. Likewise, as explained later, only the first half of the complex data has to be supplied for the complex-to-real FFT. Another implication of FFT theory is that, for real input data, the first output value, _Y(0), will always be a real number; therefore, the imaginary part will always be 0. If _n is an even number, _Y(_n/2) will also be real and thus, have a zero imaginary part. CCCCoooommmmpppplllleeeexxxx----ttttoooo----rrrreeeeaaaallll FFFFFFFFTTTTssss Consider the complex-to-real case. The effect of the computation is given by the formulas on the man pages, but with _X complex and _Y real. In general, the FFT transforms a complex sequence into a complex sequence; however, in a certain application you may know the output sequence is real, perhaps because the complex input sequence was the transform of a real sequence. In this case, you can save about half of the computational work. According to the theory of Fourier transforms, for the output sequence, _Y, to be a real sequence, the following identity on the input sequence, _X, must be true: _X_k,_L = conjg(_X_n-_k,_L) for _n/2 <= _k <= _n-1 And, in fact, the following input values _X_k,_L for _k > _n/2 do not have to be supplied, because they can be inferred from the first half of the input. Thus, in the complex-to-real routine, CCCCSSSSFFFFFFFFTTTTMMMM, the arrays can be dimensioned as follows: Fortran: COMPLEX X(0:ldx-1, 0:lot-1) PPPPaaaaggggeeee 7777 IIIINNNNTTTTRRRROOOO____FFFFFFFFTTTT((((3333SSSS)))) IIIINNNNTTTTRRRROOOO____FFFFFFFFTTTT((((3333SSSS)))) REAL Y(0:ldy-1, 0:lot-1) C/C++: scsl_complex x[lot][ldx]; float y[lot][ldy]; C++ STL: complex<float> x[lot][ldx]; float y[lot][ldy]; where _l_d_x >= _n/2 + 1, _l_d_y >= _n. In each column, there are (_n/2) + 1 complex input values and _n real output values. Even though only (_n/2) + 1 input values are supplied, the size of the transform is still _n in this case, because implicitly the FFT formula for a sequence of length _n is used. Another implication of the theory is that _X(0, _L) must be a real number (that is, must have zero imaginary part). If _n is an even number, _X(_n/2, _L) must also be real. The CCCCSSSSFFFFFFFFTTTTMMMM and CCCCSSSSFFFFFFFFTTTT routines assume that each of these values is real; if a nonzero imaginary part is given, it is ignored. IIIInnnniiiittttiiiiaaaalllliiiizzzzaaaattttiiiioooonnnn The _t_a_b_l_e array stores the trigonometric tables used in calculation of the FFT. You must initialize _t_a_b_l_e by calling the routine with _i_s_i_g_n = 0 prior to doing the transforms. If the value of the problem size, _n, does not change, _t_a_b_l_e does not have to be reinitialized. Because SSSSCCCCFFFFFFFFTTTT and CCCCSSSSFFFFFFFFTTTT use the same format for _t_a_b_l_e, either can be used to initialize it (note that CCCCCCCCFFFFFFFFTTTT uses a different table format). DDDDiiiimmmmeeeennnnssssiiiioooonnnnssss In the preceding descriptions and on the specific man pages, it is assumed that array subscripts were zero-based, as is customary in FFT applications. However, if you prefer to use the more customary Fortran style with subscripts starting at 1 you do not have to change the calling sequence. ------------------------------------------------------------------------- Routine subscripts starting at 0 subscripts starting at 1 ------------------------------------------------------------------------- CCFFT COMPLEX X(0:N-1) COMPLEX X(N) COMPLEX Y(0:N-1) COMPLEX Y(N) CCFFT2D COMPLEX X(0:ldx-1, 0:n2-1) REAL X(ldx, n2) COMPLEX Y(0:ldy-1, 0:n2-1) COMPLEX Y(ldy, n2) SCFFT REAL X(0:n-1) REAL X(n) PPPPaaaaggggeeee 8888 IIIINNNNTTTTRRRROOOO____FFFFFFFFTTTT((((3333SSSS)))) IIIINNNNTTTTRRRROOOO____FFFFFFFFTTTT((((3333SSSS)))) COMPLEX Y(0:n/2) COMPLEX Y(n/2 + 1) SCFFT2D REAL X(0:ldx-1, 0:n2-1) COMPLEX X(ldx, n2) COMPLEX Y(0:ldy-1, 0:n2-1) COMPLEX Y(ldy, n2) CCFFTM COMPLEX X(0:ldx-1, 0:lot-1) COMPLEX X(ldx, lot) COMPLEX Y(0:ldy-1, 0:lot-1) COMPLEX Y(ldy, lot) CCFFTMR COMPLEX X(0:ldx-1, 0:n-1) COMPLEX X(ldx, n) COMPLEX Y(0:ldy-1, 0:n-1) COMPLEX Y(ldy, n) ------------------------------------------------------------------------- CCCCoooonnnnvvvvoooolllluuuuttttiiiioooonnnn aaaannnndddd CCCCoooorrrrrrrreeeellllaaaattttiiiioooonnnn RRRRoooouuuuttttiiiinnnneeeessss These routines feature convolution for Finite Impulse Response (FIR) as well as correlations. Each of these routines is highly optimized for single-processor use. The routines which use two-dimensional input sequences are multitasked (multi-threaded). The convolution and correlation routines are very general. To achieve this generality and maximum flexibility, one-dimensional sequences are defined by 3 parameters. Six parameters are necessary for two-dimensional sequences. One drawback of this generality are the long calling sequences. The following table contains a summary of the filter and correlation routines. Each routine has its own man page. In this table, rows of the table represent data types for the routines in each column: * CCCC implies 32-bit complex data. * ZZZZ implies 64-bit double complex data. * SSSS implies 32-bit real data. * DDDD implies 64-bit double precision real data. Columns of the table represent the type of computation as well as the number of dimensions for which the convolution or correlation is calculated for the routines in each row: * One-dimensional FIR applies a Finite Impulse Response filter to one- dimensional signals. * One-dimensional (multiple) FIR applies a Finite Impulse Response filter to multiple one-dimensional signals. * Two-dimensional FIR applies a Finite Impulse Response filter to two- dimensional signals. * One-dimensional COR calculates the correlation of one-dimensional sequences. PPPPaaaaggggeeee 9999 IIIINNNNTTTTRRRROOOO____FFFFFFFFTTTT((((3333SSSS)))) IIIINNNNTTTTRRRROOOO____FFFFFFFFTTTT((((3333SSSS)))) * One-dimensional (multiple) COR calculates the correlation of multiple one-dimensional sequences. * Two-dimensional COR calculates the correlation of two-dimensional sequences. ------------------------------------------------------------ Type 1D (single) 1D (multiple) 2D ------------------------------------------------------------ C CFIR1D CFIRM1D CFIR2D Z ZFIR1D ZFIRM1D ZFIR2D S SFIR1D SFIRM1D SFIR2D D DFIR1D DFIRM1D DFIR2D ------------------------------------------------------------ C CCOR1D CCORM1D CCOR2D Z ZCOR1D ZCORM1D ZCOR2D S SCOR1D SCORM1D SCOR2D D DCOR1D DCORM1D DCOR2D ------------------------------------------------------------ SSSSEEEEEEEE AAAALLLLSSSSOOOO IIIINNNNTTTTRRRROOOO____SSSSCCCCSSSSLLLL(3S) PPPPaaaaggggeeee 11110000